Overview

Dataset info

Number of variables11
Number of observations30181
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory1.4 MiB
Average record size in memory48.7 B

Variables types

Numeric5
Categorical6
Boolean0
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

Event has a high cardinality: 562 distinct values Warning
Games has a high cardinality: 51 distinct values Warning
NOC has a high cardinality: 143 distinct values Warning
Sport has a high cardinality: 55 distinct values Warning

Variables

Age
Numeric

Distinct count50
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean25.42901163
Minimum13
Maximum66
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum13
5-th percentile18
Q122
Median25
Q328
95-th percentile34
Maximum66
Range53
Interquartile range6

Descriptive statistics

Standard deviation5.049684088
Coef of variation0.1985796445
Kurtosis3.058863832
Mean25.42901163
MAD3.839394115
Skewness1.067453639
Sum767473
Variance25.49930939
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[13. 13.5 14.5 15.5 16.5 ... 42.5 46.5 52.5 60.5 66. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
23 2742 9.1%
 
24 2649 8.8%
 
25 2558 8.5%
 
22 2549 8.4%
 
26 2378 7.9%
 
27 2213 7.3%
 
21 2140 7.1%
 
28 1948 6.5%
 
29 1553 5.1%
 
20 1550 5.1%
 
Other values (40) 7901 26.2%
 

Minimum 5 values

ValueCountFrequency (%) 
13 10 < 0.1%
 
14 53 0.2%
 
15 155 0.5%
 
16 284 0.9%
 
17 399 1.3%
 

Maximum 5 values

ValueCountFrequency (%) 
66 1 < 0.1%
 
61 2 < 0.1%
 
60 3 < 0.1%
 
59 1 < 0.1%
 
58 3 < 0.1%
 

df_index
Numeric

Distinct count30181
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean139517.8801
Minimum40
Maximum271103
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum40
5-th percentile15556
Q173782
Median138892
Q3207507
95-th percentile259743
Maximum271103
Range271063
Interquartile range133725

Descriptive statistics

Standard deviation77913.35373
Coef of variation0.5584470869
Kurtosis-1.177238993
Mean139517.8801
MAD67250.11654
Skewness-0.03426245443
Sum4210789140
Variance6070490690
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[4.000000e+01 2.275000e+02 5.885000e+02 3.146000e+03 3.834500e+03 ... 2.546490e+05 2.547105e+05 2.648795e+05 2.649410e+05 2.711030e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
266238 1 < 0.1%
 
36443 1 < 0.1%
 
26182 1 < 0.1%
 
202312 1 < 0.1%
 
76289 1 < 0.1%
 
212557 1 < 0.1%
 
120400 1 < 0.1%
 
181842 1 < 0.1%
 
249427 1 < 0.1%
 
194132 1 < 0.1%
 
Other values (30171) 30171 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
40 1 < 0.1%
 
41 1 < 0.1%
 
42 1 < 0.1%
 
44 1 < 0.1%
 
48 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
271103 1 < 0.1%
 
271102 1 < 0.1%
 
271082 1 < 0.1%
 
271080 1 < 0.1%
 
271078 1 < 0.1%
 

Event
Categorical

Distinct count562
Unique (%)1.9%
Missing (%)0.0%
Missing (n)0
Ice Hockey Men's Ice Hockey
 
1001
Football Men's Football
 
783
Hockey Men's Hockey
 
714
Other values (559)
27683
ValueCountFrequency (%) 
Ice Hockey Men's Ice Hockey 1001 3.3%
 
Football Men's Football 783 2.6%
 
Hockey Men's Hockey 714 2.4%
 
Basketball Men's Basketball 610 2.0%
 
Water Polo Men's Water Polo 573 1.9%
 
Handball Men's Handball 516 1.7%
 
Volleyball Men's Volleyball 489 1.6%
 
Volleyball Women's Volleyball 469 1.6%
 
Hockey Women's Hockey 454 1.5%
 
Rowing Men's Coxed Eights 446 1.5%
 
Other values (552) 24126 79.9%
 
Max length61
Mean length31.1856466
Min length17
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Games
Categorical

Distinct count51
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
2008 Summer
 
2035
2016 Summer
 
2014
2004 Summer
 
2000
Other values (48)
24132
ValueCountFrequency (%) 
2008 Summer 2035 6.7%
 
2016 Summer 2014 6.7%
 
2004 Summer 2000 6.6%
 
2000 Summer 1993 6.6%
 
2012 Summer 1915 6.3%
 
1996 Summer 1717 5.7%
 
1988 Summer 1576 5.2%
 
1992 Summer 1523 5.0%
 
1984 Summer 1463 4.8%
 
1980 Summer 1377 4.6%
 
Other values (41) 12568 41.6%
 
Max length11
Mean length11
Min length11
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Height
Numeric

Distinct count86
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean177.6423578
Minimum136
Maximum223
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum136
5-th percentile160
Q1170
Median178
Q3185
95-th percentile196
Maximum223
Range87
Interquartile range15

Descriptive statistics

Standard deviation10.92418844
Coef of variation0.0614954033
Kurtosis0.1539512675
Mean177.6423578
MAD8.702072715
Skewness0.04199807524
Sum5361424
Variance119.337893
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[136. 144.5 149.5 150.5 151.5 ... 205.5 208.5 211.5 218.5 223. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
180 1715 5.7%
 
178 1531 5.1%
 
170 1469 4.9%
 
183 1412 4.7%
 
175 1379 4.6%
 
185 1178 3.9%
 
173 1090 3.6%
 
172 1025 3.4%
 
182 942 3.1%
 
168 930 3.1%
 
Other values (76) 17510 58.0%
 

Minimum 5 values

ValueCountFrequency (%) 
136 5 < 0.1%
 
137 2 < 0.1%
 
138 1 < 0.1%
 
139 4 < 0.1%
 
140 6 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
223 4 < 0.1%
 
220 3 < 0.1%
 
219 1 < 0.1%
 
218 6 < 0.1%
 
217 4 < 0.1%
 

Medal
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Gold
10167
Bronze
10148
Silver
9866
ValueCountFrequency (%) 
Gold 10167 33.7%
 
Bronze 10148 33.6%
 
Silver 9866 32.7%
 
Max length6
Mean length5.326264869
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

NOC
Categorical

Distinct count143
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
USA
 
4383
URS
 
2246
GER
 
1612
Other values (140)
21940
ValueCountFrequency (%) 
USA 4383 14.5%
 
URS 2246 7.4%
 
GER 1612 5.3%
 
AUS 1206 4.0%
 
RUS 1134 3.8%
 
ITA 1060 3.5%
 
CAN 1060 3.5%
 
GBR 1031 3.4%
 
GDR 995 3.3%
 
FRA 987 3.3%
 
Other values (133) 14467 47.9%
 
Max length3
Mean length3
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Sex
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
M
19831
F
10350
ValueCountFrequency (%) 
M 19831 65.7%
 
F 10350 34.3%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Sport
Categorical

Distinct count55
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Athletics
 
3648
Swimming
 
2486
Rowing
 
2104
Other values (52)
21943
ValueCountFrequency (%) 
Athletics 3648 12.1%
 
Swimming 2486 8.2%
 
Rowing 2104 7.0%
 
Ice Hockey 1301 4.3%
 
Hockey 1168 3.9%
 
Gymnastics 1161 3.8%
 
Fencing 1109 3.7%
 
Football 1084 3.6%
 
Canoeing 1041 3.4%
 
Basketball 1000 3.3%
 
Other values (45) 14079 46.6%
 
Max length25
Mean length9.165037606
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

Weight
Numeric

Distinct count129
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean73.75033962
Minimum28
Maximum182
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum28
5-th percentile52
Q163
Median73
Q383
95-th percentile100
Maximum182
Range154
Interquartile range20

Descriptive statistics

Standard deviation15.00432874
Coef of variation0.2034475884
Kurtosis1.550700274
Mean73.75033962
MAD11.70538571
Skewness0.689245348
Sum2225859
Variance225.129881
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 28. 32.5 38.5 41.5 46.5 ... 125.5 129.5 130.5 146.5 182. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
70 1258 4.2%
 
75 1173 3.9%
 
68 949 3.1%
 
80 930 3.1%
 
73 911 3.0%
 
60 908 3.0%
 
72 892 3.0%
 
65 852 2.8%
 
78 776 2.6%
 
64 763 2.5%
 
Other values (119) 20769 68.8%
 

Minimum 5 values

ValueCountFrequency (%) 
28 2 < 0.1%
 
30 5 < 0.1%
 
31 1 < 0.1%
 
32 3 < 0.1%
 
33 9 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
182 1 < 0.1%
 
175 1 < 0.1%
 
170 2 < 0.1%
 
167 1 < 0.1%
 
163 2 < 0.1%
 

Year
Numeric

Distinct count35
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1988.005964
Minimum1896
Maximum2016
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1896
5-th percentile1948
Q11976
Median1992
Q32006
95-th percentile2016
Maximum2016
Range120
Interquartile range30

Descriptive statistics

Standard deviation22.71845057
Coef of variation0.01142775775
Kurtosis1.693381115
Mean1988.005964
MAD17.79612648
Skewness-1.197529559
Sum60000008
Variance516.1279962
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=35)
Histogram
Histogram with variable size bins (bins=[1896. 1902. 1905. 1910. 1922. ... 2009. 2011. 2013. 2015. 2016.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2008 2035 6.7%
 
2016 2014 6.7%
 
2004 2000 6.6%
 
2000 1993 6.6%
 
2012 1915 6.3%
 
1992 1834 6.1%
 
1988 1827 6.1%
 
1996 1717 5.7%
 
1984 1683 5.6%
 
1980 1572 5.2%
 
Other values (25) 11591 38.4%
 

Minimum 5 values

ValueCountFrequency (%) 
1896 20 0.1%
 
1900 38 0.1%
 
1904 59 0.2%
 
1906 69 0.2%
 
1908 134 0.4%
 

Maximum 5 values

ValueCountFrequency (%) 
2016 2014 6.7%
 
2014 570 1.9%
 
2012 1915 6.3%
 
2010 515 1.7%
 
2008 2035 6.7%
 

Correlations

Missing values

Sample

First rows

Agedf_indexEventGamesHeightMedalNOCSexSportWeightYear
021125750Ice Hockey Men's Ice Hockey1992 Winter177GoldEUNMIce Hockey801992
12847707Water Polo Men's Water Polo1992 Summer180GoldITAMWater Polo721992
217247453Athletics Women's 4 x 100 metres Relay1972 Summer167BronzeCUBFAthletics541972
32596724Rowing Men's Coxed Eights1972 Summer191SilverUSAMRowing901972
4296158Rowing Men's Coxed Eights1964 Summer186GoldUSAMRowing911964
525191781Handball Men's Handball2016 Summer190SilverFRAMHandball922016
621147661Athletics Men's 4 x 400 metres Relay1980 Summer180BronzeITAMAthletics731980
726157511Canoeing Women's Kayak Fours, 500 metres1992 Summer176GoldHUNFCanoeing661992
821122955Wrestling Men's Super-Heavyweight, Greco-Roman1976 Summer193GoldURSMWrestling1151976
922159127Athletics Men's Javelin Throw1952 Summer179SilverUSAMAthletics771952

Last rows

Agedf_indexEventGamesHeightMedalNOCSexSportWeightYear
3017128240574Gymnastics Men's Horizontal Bar1964 Summer170SilverURSMGymnastics701964
301721776428Athletics Women's 4 x 100 metres Relay1984 Summer167SilverCANFAthletics591984
301731980162Athletics Men's 4 x 100 metres Relay1976 Summer173GoldUSAMAthletics671976
3017428270111Volleyball Women's Volleyball2016 Summer186SilverSRBFVolleyball722016
3017528161542Fencing Women's Foil, Team1984 Summer169SilverROUFFencing581984
301762318381Cross Country Skiing Women's 5/10 kilometres Pursuit1992 Winter158SilverITAFCross Country Skiing451992
3017724141646Fencing Women's epee, Team2004 Summer174GoldRUSFFencing622004
301782398983Fencing Men's Sabre, Individual1960 Summer183SilverHUNMFencing751960
3017926192528Fencing Men's Sabre, Team2000 Summer178GoldRUSMFencing782000
3018023130584Handball Men's Handball2004 Summer196GoldCROMHandball992004
Pandas Profiling Report

Overview

Dataset info

Number of variables11
Number of observations30181
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory1.4 MiB
Average record size in memory48.7 B

Variables types

Numeric5
Categorical6
Boolean0
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

Event has a high cardinality: 562 distinct values Warning
Games has a high cardinality: 51 distinct values Warning
NOC has a high cardinality: 143 distinct values Warning
Sport has a high cardinality: 55 distinct values Warning

Variables

Age
Numeric

Distinct count50
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean25.42901163
Minimum13
Maximum66
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum13
5-th percentile18
Q122
Median25
Q328
95-th percentile34
Maximum66
Range53
Interquartile range6

Descriptive statistics

Standard deviation5.049684088
Coef of variation0.1985796445
Kurtosis3.058863832
Mean25.42901163
MAD3.839394115
Skewness1.067453639
Sum767473
Variance25.49930939
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[13. 13.5 14.5 15.5 16.5 ... 42.5 46.5 52.5 60.5 66. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
23 2742 9.1%
 
24 2649 8.8%
 
25 2558 8.5%
 
22 2549 8.4%
 
26 2378 7.9%
 
27 2213 7.3%
 
21 2140 7.1%
 
28 1948 6.5%
 
29 1553 5.1%
 
20 1550 5.1%
 
Other values (40) 7901 26.2%
 

Minimum 5 values

ValueCountFrequency (%) 
13 10 < 0.1%
 
14 53 0.2%
 
15 155 0.5%
 
16 284 0.9%
 
17 399 1.3%
 

Maximum 5 values

ValueCountFrequency (%) 
66 1 < 0.1%
 
61 2 < 0.1%
 
60 3 < 0.1%
 
59 1 < 0.1%
 
58 3 < 0.1%
 

df_index
Numeric

Distinct count30181
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean139517.8801
Minimum40
Maximum271103
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum40
5-th percentile15556
Q173782
Median138892
Q3207507
95-th percentile259743
Maximum271103
Range271063
Interquartile range133725

Descriptive statistics

Standard deviation77913.35373
Coef of variation0.5584470869
Kurtosis-1.177238993
Mean139517.8801
MAD67250.11654
Skewness-0.03426245443
Sum4210789140
Variance6070490690
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[4.000000e+01 2.275000e+02 5.885000e+02 3.146000e+03 3.834500e+03 ... 2.546490e+05 2.547105e+05 2.648795e+05 2.649410e+05 2.711030e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
266238 1 < 0.1%
 
36443 1 < 0.1%
 
26182 1 < 0.1%
 
202312 1 < 0.1%
 
76289 1 < 0.1%
 
212557 1 < 0.1%
 
120400 1 < 0.1%
 
181842 1 < 0.1%
 
249427 1 < 0.1%
 
194132 1 < 0.1%
 
Other values (30171) 30171 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
40 1 < 0.1%
 
41 1 < 0.1%
 
42 1 < 0.1%
 
44 1 < 0.1%
 
48 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
271103 1 < 0.1%
 
271102 1 < 0.1%
 
271082 1 < 0.1%
 
271080 1 < 0.1%
 
271078 1 < 0.1%
 

Event
Categorical

Distinct count562
Unique (%)1.9%
Missing (%)0.0%
Missing (n)0
Ice Hockey Men's Ice Hockey
 
1001
Football Men's Football
 
783
Hockey Men's Hockey
 
714
Other values (559)
27683
ValueCountFrequency (%) 
Ice Hockey Men's Ice Hockey 1001 3.3%
 
Football Men's Football 783 2.6%
 
Hockey Men's Hockey 714 2.4%
 
Basketball Men's Basketball 610 2.0%
 
Water Polo Men's Water Polo 573 1.9%
 
Handball Men's Handball 516 1.7%
 
Volleyball Men's Volleyball 489 1.6%
 
Volleyball Women's Volleyball 469 1.6%
 
Hockey Women's Hockey 454 1.5%
 
Rowing Men's Coxed Eights 446 1.5%
 
Other values (552) 24126 79.9%
 
Max length61
Mean length31.1856466
Min length17
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Games
Categorical

Distinct count51
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
2008 Summer
 
2035
2016 Summer
 
2014
2004 Summer
 
2000
Other values (48)
24132
ValueCountFrequency (%) 
2008 Summer 2035 6.7%
 
2016 Summer 2014 6.7%
 
2004 Summer 2000 6.6%
 
2000 Summer 1993 6.6%
 
2012 Summer 1915 6.3%
 
1996 Summer 1717 5.7%
 
1988 Summer 1576 5.2%
 
1992 Summer 1523 5.0%
 
1984 Summer 1463 4.8%
 
1980 Summer 1377 4.6%
 
Other values (41) 12568 41.6%
 
Max length11
Mean length11
Min length11
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Height
Numeric

Distinct count86
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean177.6423578
Minimum136
Maximum223
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum136
5-th percentile160
Q1170
Median178
Q3185
95-th percentile196
Maximum223
Range87
Interquartile range15

Descriptive statistics

Standard deviation10.92418844
Coef of variation0.0614954033
Kurtosis0.1539512675
Mean177.6423578
MAD8.702072715
Skewness0.04199807524
Sum5361424
Variance119.337893
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[136. 144.5 149.5 150.5 151.5 ... 205.5 208.5 211.5 218.5 223. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
180 1715 5.7%
 
178 1531 5.1%
 
170 1469 4.9%
 
183 1412 4.7%
 
175 1379 4.6%
 
185 1178 3.9%
 
173 1090 3.6%
 
172 1025 3.4%
 
182 942 3.1%
 
168 930 3.1%
 
Other values (76) 17510 58.0%
 

Minimum 5 values

ValueCountFrequency (%) 
136 5 < 0.1%
 
137 2 < 0.1%
 
138 1 < 0.1%
 
139 4 < 0.1%
 
140 6 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
223 4 < 0.1%
 
220 3 < 0.1%
 
219 1 < 0.1%
 
218 6 < 0.1%
 
217 4 < 0.1%
 

Medal
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Gold
10167
Bronze
10148
Silver
9866
ValueCountFrequency (%) 
Gold 10167 33.7%
 
Bronze 10148 33.6%
 
Silver 9866 32.7%
 
Max length6
Mean length5.326264869
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

NOC
Categorical

Distinct count143
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
USA
 
4383
URS
 
2246
GER
 
1612
Other values (140)
21940
ValueCountFrequency (%) 
USA 4383 14.5%
 
URS 2246 7.4%
 
GER 1612 5.3%
 
AUS 1206 4.0%
 
RUS 1134 3.8%
 
ITA 1060 3.5%
 
CAN 1060 3.5%
 
GBR 1031 3.4%
 
GDR 995 3.3%
 
FRA 987 3.3%
 
Other values (133) 14467 47.9%
 
Max length3
Mean length3
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Sex
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
M
19831
F
10350
ValueCountFrequency (%) 
M 19831 65.7%
 
F 10350 34.3%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Sport
Categorical

Distinct count55
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Athletics
 
3648
Swimming
 
2486
Rowing
 
2104
Other values (52)
21943
ValueCountFrequency (%) 
Athletics 3648 12.1%
 
Swimming 2486 8.2%
 
Rowing 2104 7.0%
 
Ice Hockey 1301 4.3%
 
Hockey 1168 3.9%
 
Gymnastics 1161 3.8%
 
Fencing 1109 3.7%
 
Football 1084 3.6%
 
Canoeing 1041 3.4%
 
Basketball 1000 3.3%
 
Other values (45) 14079 46.6%
 
Max length25
Mean length9.165037606
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

Weight
Numeric

Distinct count129
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean73.75033962
Minimum28
Maximum182
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum28
5-th percentile52
Q163
Median73
Q383
95-th percentile100
Maximum182
Range154
Interquartile range20

Descriptive statistics

Standard deviation15.00432874
Coef of variation0.2034475884
Kurtosis1.550700274
Mean73.75033962
MAD11.70538571
Skewness0.689245348
Sum2225859
Variance225.129881
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 28. 32.5 38.5 41.5 46.5 ... 125.5 129.5 130.5 146.5 182. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
70 1258 4.2%
 
75 1173 3.9%
 
68 949 3.1%
 
80 930 3.1%
 
73 911 3.0%
 
60 908 3.0%
 
72 892 3.0%
 
65 852 2.8%
 
78 776 2.6%
 
64 763 2.5%
 
Other values (119) 20769 68.8%
 

Minimum 5 values

ValueCountFrequency (%) 
28 2 < 0.1%
 
30 5 < 0.1%
 
31 1 < 0.1%
 
32 3 < 0.1%
 
33 9 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
182 1 < 0.1%
 
175 1 < 0.1%
 
170 2 < 0.1%
 
167 1 < 0.1%
 
163 2 < 0.1%
 

Year
Numeric

Distinct count35
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1988.005964
Minimum1896
Maximum2016
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1896
5-th percentile1948
Q11976
Median1992
Q32006
95-th percentile2016
Maximum2016
Range120
Interquartile range30

Descriptive statistics

Standard deviation22.71845057
Coef of variation0.01142775775
Kurtosis1.693381115
Mean1988.005964
MAD17.79612648
Skewness-1.197529559
Sum60000008
Variance516.1279962
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=35)
Histogram
Histogram with variable size bins (bins=[1896. 1902. 1905. 1910. 1922. ... 2009. 2011. 2013. 2015. 2016.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2008 2035 6.7%
 
2016 2014 6.7%
 
2004 2000 6.6%
 
2000 1993 6.6%
 
2012 1915 6.3%
 
1992 1834 6.1%
 
1988 1827 6.1%
 
1996 1717 5.7%
 
1984 1683 5.6%
 
1980 1572 5.2%
 
Other values (25) 11591 38.4%
 

Minimum 5 values

ValueCountFrequency (%) 
1896 20 0.1%
 
1900 38 0.1%
 
1904 59 0.2%
 
1906 69 0.2%
 
1908 134 0.4%
 

Maximum 5 values

ValueCountFrequency (%) 
2016 2014 6.7%
 
2014 570 1.9%
 
2012 1915 6.3%
 
2010 515 1.7%
 
2008 2035 6.7%
 

Correlations

Missing values

Sample

First rows

Agedf_indexEventGamesHeightMedalNOCSexSportWeightYear
021125750Ice Hockey Men's Ice Hockey1992 Winter177GoldEUNMIce Hockey801992
12847707Water Polo Men's Water Polo1992 Summer180GoldITAMWater Polo721992
217247453Athletics Women's 4 x 100 metres Relay1972 Summer167BronzeCUBFAthletics541972
32596724Rowing Men's Coxed Eights1972 Summer191SilverUSAMRowing901972
4296158Rowing Men's Coxed Eights1964 Summer186GoldUSAMRowing911964
525191781Handball Men's Handball2016 Summer190SilverFRAMHandball922016
621147661Athletics Men's 4 x 400 metres Relay1980 Summer180BronzeITAMAthletics731980
726157511Canoeing Women's Kayak Fours, 500 metres1992 Summer176GoldHUNFCanoeing661992
821122955Wrestling Men's Super-Heavyweight, Greco-Roman1976 Summer193GoldURSMWrestling1151976
922159127Athletics Men's Javelin Throw1952 Summer179SilverUSAMAthletics771952

Last rows

Agedf_indexEventGamesHeightMedalNOCSexSportWeightYear
3017128240574Gymnastics Men's Horizontal Bar1964 Summer170SilverURSMGymnastics701964
301721776428Athletics Women's 4 x 100 metres Relay1984 Summer167SilverCANFAthletics591984
301731980162Athletics Men's 4 x 100 metres Relay1976 Summer173GoldUSAMAthletics671976
3017428270111Volleyball Women's Volleyball2016 Summer186SilverSRBFVolleyball722016
3017528161542Fencing Women's Foil, Team1984 Summer169SilverROUFFencing581984
301762318381Cross Country Skiing Women's 5/10 kilometres Pursuit1992 Winter158SilverITAFCross Country Skiing451992
3017724141646Fencing Women's epee, Team2004 Summer174GoldRUSFFencing622004
301782398983Fencing Men's Sabre, Individual1960 Summer183SilverHUNMFencing751960
3017926192528Fencing Men's Sabre, Team2000 Summer178GoldRUSMFencing782000
3018023130584Handball Men's Handball2004 Summer196GoldCROMHandball992004
Pandas Profiling Report

Overview

Dataset info

Number of variables11
Number of observations30181
Missing cells0 (0.0%)
Duplicate rows0 (0.0%)
Total size in memory1.4 MiB
Average record size in memory48.7 B

Variables types

Numeric5
Categorical6
Boolean0
Date0
URL0
Text (Unique)0
Rejected0
Unsupported0

Warnings

Event has a high cardinality: 562 distinct values Warning
Games has a high cardinality: 51 distinct values Warning
NOC has a high cardinality: 143 distinct values Warning
Sport has a high cardinality: 55 distinct values Warning

Variables

Age
Numeric

Distinct count50
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean25.42901163
Minimum13
Maximum66
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum13
5-th percentile18
Q122
Median25
Q328
95-th percentile34
Maximum66
Range53
Interquartile range6

Descriptive statistics

Standard deviation5.049684088
Coef of variation0.1985796445
Kurtosis3.058863832
Mean25.42901163
MAD3.839394115
Skewness1.067453639
Sum767473
Variance25.49930939
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[13. 13.5 14.5 15.5 16.5 ... 42.5 46.5 52.5 60.5 66. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
23 2742 9.1%
 
24 2649 8.8%
 
25 2558 8.5%
 
22 2549 8.4%
 
26 2378 7.9%
 
27 2213 7.3%
 
21 2140 7.1%
 
28 1948 6.5%
 
29 1553 5.1%
 
20 1550 5.1%
 
Other values (40) 7901 26.2%
 

Minimum 5 values

ValueCountFrequency (%) 
13 10 < 0.1%
 
14 53 0.2%
 
15 155 0.5%
 
16 284 0.9%
 
17 399 1.3%
 

Maximum 5 values

ValueCountFrequency (%) 
66 1 < 0.1%
 
61 2 < 0.1%
 
60 3 < 0.1%
 
59 1 < 0.1%
 
58 3 < 0.1%
 

df_index
Numeric

Distinct count30181
Unique (%)100.0%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean139517.8801
Minimum40
Maximum271103
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum40
5-th percentile15556
Q173782
Median138892
Q3207507
95-th percentile259743
Maximum271103
Range271063
Interquartile range133725

Descriptive statistics

Standard deviation77913.35373
Coef of variation0.5584470869
Kurtosis-1.177238993
Mean139517.8801
MAD67250.11654
Skewness-0.03426245443
Sum4210789140
Variance6070490690
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[4.000000e+01 2.275000e+02 5.885000e+02 3.146000e+03 3.834500e+03 ... 2.546490e+05 2.547105e+05 2.648795e+05 2.649410e+05 2.711030e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
266238 1 < 0.1%
 
36443 1 < 0.1%
 
26182 1 < 0.1%
 
202312 1 < 0.1%
 
76289 1 < 0.1%
 
212557 1 < 0.1%
 
120400 1 < 0.1%
 
181842 1 < 0.1%
 
249427 1 < 0.1%
 
194132 1 < 0.1%
 
Other values (30171) 30171 > 99.9%
 

Minimum 5 values

ValueCountFrequency (%) 
40 1 < 0.1%
 
41 1 < 0.1%
 
42 1 < 0.1%
 
44 1 < 0.1%
 
48 1 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
271103 1 < 0.1%
 
271102 1 < 0.1%
 
271082 1 < 0.1%
 
271080 1 < 0.1%
 
271078 1 < 0.1%
 

Event
Categorical

Distinct count562
Unique (%)1.9%
Missing (%)0.0%
Missing (n)0
Ice Hockey Men's Ice Hockey
 
1001
Football Men's Football
 
783
Hockey Men's Hockey
 
714
Other values (559)
27683
ValueCountFrequency (%) 
Ice Hockey Men's Ice Hockey 1001 3.3%
 
Football Men's Football 783 2.6%
 
Hockey Men's Hockey 714 2.4%
 
Basketball Men's Basketball 610 2.0%
 
Water Polo Men's Water Polo 573 1.9%
 
Handball Men's Handball 516 1.7%
 
Volleyball Men's Volleyball 489 1.6%
 
Volleyball Women's Volleyball 469 1.6%
 
Hockey Women's Hockey 454 1.5%
 
Rowing Men's Coxed Eights 446 1.5%
 
Other values (552) 24126 79.9%
 
Max length61
Mean length31.1856466
Min length17
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Games
Categorical

Distinct count51
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
2008 Summer
 
2035
2016 Summer
 
2014
2004 Summer
 
2000
Other values (48)
24132
ValueCountFrequency (%) 
2008 Summer 2035 6.7%
 
2016 Summer 2014 6.7%
 
2004 Summer 2000 6.6%
 
2000 Summer 1993 6.6%
 
2012 Summer 1915 6.3%
 
1996 Summer 1717 5.7%
 
1988 Summer 1576 5.2%
 
1992 Summer 1523 5.0%
 
1984 Summer 1463 4.8%
 
1980 Summer 1377 4.6%
 
Other values (41) 12568 41.6%
 
Max length11
Mean length11
Min length11
Contains charsTrue
Contains digitsTrue
Contains spacesTrue
Contains non-wordsTrue

Height
Numeric

Distinct count86
Unique (%)0.3%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean177.6423578
Minimum136
Maximum223
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum136
5-th percentile160
Q1170
Median178
Q3185
95-th percentile196
Maximum223
Range87
Interquartile range15

Descriptive statistics

Standard deviation10.92418844
Coef of variation0.0614954033
Kurtosis0.1539512675
Mean177.6423578
MAD8.702072715
Skewness0.04199807524
Sum5361424
Variance119.337893
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[136. 144.5 149.5 150.5 151.5 ... 205.5 208.5 211.5 218.5 223. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
180 1715 5.7%
 
178 1531 5.1%
 
170 1469 4.9%
 
183 1412 4.7%
 
175 1379 4.6%
 
185 1178 3.9%
 
173 1090 3.6%
 
172 1025 3.4%
 
182 942 3.1%
 
168 930 3.1%
 
Other values (76) 17510 58.0%
 

Minimum 5 values

ValueCountFrequency (%) 
136 5 < 0.1%
 
137 2 < 0.1%
 
138 1 < 0.1%
 
139 4 < 0.1%
 
140 6 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
223 4 < 0.1%
 
220 3 < 0.1%
 
219 1 < 0.1%
 
218 6 < 0.1%
 
217 4 < 0.1%
 

Medal
Categorical

Distinct count3
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
Gold
10167
Bronze
10148
Silver
9866
ValueCountFrequency (%) 
Gold 10167 33.7%
 
Bronze 10148 33.6%
 
Silver 9866 32.7%
 
Max length6
Mean length5.326264869
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

NOC
Categorical

Distinct count143
Unique (%)0.5%
Missing (%)0.0%
Missing (n)0
USA
 
4383
URS
 
2246
GER
 
1612
Other values (140)
21940
ValueCountFrequency (%) 
USA 4383 14.5%
 
URS 2246 7.4%
 
GER 1612 5.3%
 
AUS 1206 4.0%
 
RUS 1134 3.8%
 
ITA 1060 3.5%
 
CAN 1060 3.5%
 
GBR 1031 3.4%
 
GDR 995 3.3%
 
FRA 987 3.3%
 
Other values (133) 14467 47.9%
 
Max length3
Mean length3
Min length3
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Sex
Categorical

Distinct count2
Unique (%)< 0.1%
Missing (%)0.0%
Missing (n)0
M
19831
F
10350
ValueCountFrequency (%) 
M 19831 65.7%
 
F 10350 34.3%
 
Max length1
Mean length1
Min length1
Contains charsTrue
Contains digitsFalse
Contains spacesFalse
Contains non-wordsFalse

Sport
Categorical

Distinct count55
Unique (%)0.2%
Missing (%)0.0%
Missing (n)0
Athletics
 
3648
Swimming
 
2486
Rowing
 
2104
Other values (52)
21943
ValueCountFrequency (%) 
Athletics 3648 12.1%
 
Swimming 2486 8.2%
 
Rowing 2104 7.0%
 
Ice Hockey 1301 4.3%
 
Hockey 1168 3.9%
 
Gymnastics 1161 3.8%
 
Fencing 1109 3.7%
 
Football 1084 3.6%
 
Canoeing 1041 3.4%
 
Basketball 1000 3.3%
 
Other values (45) 14079 46.6%
 
Max length25
Mean length9.165037606
Min length4
Contains charsTrue
Contains digitsFalse
Contains spacesTrue
Contains non-wordsTrue

Weight
Numeric

Distinct count129
Unique (%)0.4%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean73.75033962
Minimum28
Maximum182
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum28
5-th percentile52
Q163
Median73
Q383
95-th percentile100
Maximum182
Range154
Interquartile range20

Descriptive statistics

Standard deviation15.00432874
Coef of variation0.2034475884
Kurtosis1.550700274
Mean73.75033962
MAD11.70538571
Skewness0.689245348
Sum2225859
Variance225.129881
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=50)
Histogram
Histogram with variable size bins (bins=[ 28. 32.5 38.5 41.5 46.5 ... 125.5 129.5 130.5 146.5 182. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
70 1258 4.2%
 
75 1173 3.9%
 
68 949 3.1%
 
80 930 3.1%
 
73 911 3.0%
 
60 908 3.0%
 
72 892 3.0%
 
65 852 2.8%
 
78 776 2.6%
 
64 763 2.5%
 
Other values (119) 20769 68.8%
 

Minimum 5 values

ValueCountFrequency (%) 
28 2 < 0.1%
 
30 5 < 0.1%
 
31 1 < 0.1%
 
32 3 < 0.1%
 
33 9 < 0.1%
 

Maximum 5 values

ValueCountFrequency (%) 
182 1 < 0.1%
 
175 1 < 0.1%
 
170 2 < 0.1%
 
167 1 < 0.1%
 
163 2 < 0.1%
 

Year
Numeric

Distinct count35
Unique (%)0.1%
Missing (%)0.0%
Missing (n)0
Infinite (%)0.0%
Infinite (n)0
Mean1988.005964
Minimum1896
Maximum2016
Zeros (%)0.0%
Mini histogram

Quantile statistics

Minimum1896
5-th percentile1948
Q11976
Median1992
Q32006
95-th percentile2016
Maximum2016
Range120
Interquartile range30

Descriptive statistics

Standard deviation22.71845057
Coef of variation0.01142775775
Kurtosis1.693381115
Mean1988.005964
MAD17.79612648
Skewness-1.197529559
Sum60000008
Variance516.1279962
Memory size235.9 KiB
Histogram
Histogram with fixed size bins (bins=35)
Histogram
Histogram with variable size bins (bins=[1896. 1902. 1905. 1910. 1922. ... 2009. 2011. 2013. 2015. 2016.], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2008 2035 6.7%
 
2016 2014 6.7%
 
2004 2000 6.6%
 
2000 1993 6.6%
 
2012 1915 6.3%
 
1992 1834 6.1%
 
1988 1827 6.1%
 
1996 1717 5.7%
 
1984 1683 5.6%
 
1980 1572 5.2%
 
Other values (25) 11591 38.4%
 

Minimum 5 values

ValueCountFrequency (%) 
1896 20 0.1%
 
1900 38 0.1%
 
1904 59 0.2%
 
1906 69 0.2%
 
1908 134 0.4%
 

Maximum 5 values

ValueCountFrequency (%) 
2016 2014 6.7%
 
2014 570 1.9%
 
2012 1915 6.3%
 
2010 515 1.7%
 
2008 2035 6.7%
 

Correlations

Missing values

Sample

First rows

Agedf_indexEventGamesHeightMedalNOCSexSportWeightYear
021125750Ice Hockey Men's Ice Hockey1992 Winter177GoldEUNMIce Hockey801992
12847707Water Polo Men's Water Polo1992 Summer180GoldITAMWater Polo721992
217247453Athletics Women's 4 x 100 metres Relay1972 Summer167BronzeCUBFAthletics541972
32596724Rowing Men's Coxed Eights1972 Summer191SilverUSAMRowing901972
4296158Rowing Men's Coxed Eights1964 Summer186GoldUSAMRowing911964
525191781Handball Men's Handball2016 Summer190SilverFRAMHandball922016
621147661Athletics Men's 4 x 400 metres Relay1980 Summer180BronzeITAMAthletics731980
726157511Canoeing Women's Kayak Fours, 500 metres1992 Summer176GoldHUNFCanoeing661992
821122955Wrestling Men's Super-Heavyweight, Greco-Roman1976 Summer193GoldURSMWrestling1151976
922159127Athletics Men's Javelin Throw1952 Summer179SilverUSAMAthletics771952

Last rows

Agedf_indexEventGamesHeightMedalNOCSexSportWeightYear
3017128240574Gymnastics Men's Horizontal Bar1964 Summer170SilverURSMGymnastics701964
301721776428Athletics Women's 4 x 100 metres Relay1984 Summer167SilverCANFAthletics591984
301731980162Athletics Men's 4 x 100 metres Relay1976 Summer173GoldUSAMAthletics671976
3017428270111Volleyball Women's Volleyball2016 Summer186SilverSRBFVolleyball722016
3017528161542Fencing Women's Foil, Team1984 Summer169SilverROUFFencing581984
301762318381Cross Country Skiing Women's 5/10 kilometres Pursuit1992 Winter158SilverITAFCross Country Skiing451992
3017724141646Fencing Women's epee, Team2004 Summer174GoldRUSFFencing622004
301782398983Fencing Men's Sabre, Individual1960 Summer183SilverHUNMFencing751960
3017926192528Fencing Men's Sabre, Team2000 Summer178GoldRUSMFencing782000
3018023130584Handball Men's Handball2004 Summer196GoldCROMHandball992004